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Human cDNA clones encoding four novel transmembrane protein 
serineAhreonine kinases, denoted activin receptor-like kinase (ALK) -1, -2, -3 and -4, 
were obtained using a polymerase chain reaction (PCFty-based strategy. The PCR 
primers were designed based upon the sequence similarity between the activin 
receptor type II and Daf-1. The cDNA clones for ALK-1 , -2 and -3 encode complete 
proteins of 503, 509 and 532 amino acids, respectively. The ALK-4 cDNA is 
incomplete and the predicted protein of 383 amino acids has a truncated 
extracellular domain. The ALKs share similar domain structures, comprising of 
predicted signal sequences at the N-terminals, followed by hydrophilic cysteine-rich 
binding domains, single hydrophobic transmembrane regions and C-terminal 
intracellular portions which consist almost entirely of putative serine/threonine 
kinase domains. The ALKs have approximately 40% sequence identity to activin 
receptors type II and MB, TGF-p type II receptor and Daf-1 in the kinase domains. 
However, the sequence identities are higher (60-79%) between ALK-1 , -2, -3 and -4, 
suggesting that they form a subfamily among the receptor serineAhreonine kinases. 
The extracellular domains of ALKs show only little sequence identity to other 
receptor serine/threonine kinases, but the cysteine residues are conserved. Tht. 
structural properties suggest that ALK-1 through 4 are receptors that may bind 
ligands that are members of the TGF-p superfamily. The expression of mRNA in 
human tissues varied for the different ALKs; ALK-2 and ALK-4 showed ubiquitous 
tissue expression patterns, whereas the distribution of ALK-1 and ALK-3 varied 
strongly between different tissues with more restricted expression patterns. These 
results suggest that each ALK may have different in vivo functions. 



Introduction 

Transforming growth factor-p (TGF-p) is a family of multifunctional 
peptides of which three isoforms, termed TGF-p1, TGF-p2 and TGF-P3, have 
been identified in mammals (for reviews see Roberts and Sporn, 1990; Moses 
et al., 1990). The TGF-p family belongs to a larger superfamily of structurally 
related signalling proteins, including activins (Vale et al., 1986; Ling era/., 
1986), inhibins (Mason et al., 1985; Forage et al., 1986), Mullerian inhibiting 
substance (MIS; Cate et al., 1986), and bone morphogenetic proteins (BMPs; 
Wozney et al., 1988; Wang et al., 1988; Celeste et al., 1990) or osteogenic 
proteins (OPs; Ozkaynak era/., 1990; 1992). All these molecules regulate cell 
proliferation, differentiation and various other cell functions (for reviews see 
Roberts and Sporn, 1990; Massague, 1990). 

Members of the TGF-p superfamily exert their effects on cells via 
binding to specific receptors. The receptors for TGF-p have been most 
thoroughly characterized (for review see Massague. 1992; Lin and Lodish, 
1993). By affinity labeling with radiolabeled TGF-p, a number of putative cell 
surface receptors have been identified, including three distinct size classes 
denoted receptor type I (53 kd), type II (75 kd), type III (or betaglycan, 300 kd 
proteoglycan with a 120 kd core protein) (Massague and Like, 1985; Cheifetz 
et al., 1986) and more recently endoglin (homodimer of two 95 kd subunits) 
(Cheifetz et al., 1992). Current evidence implicates that receptors type I and 
type II are directly involved in receptor signal transduction (Segarini et al., 
1989; Boyd and Massague, 1989; Laiho era/., 1990); the receptors may 
interact and form a heteromeric complex (Laiho et al., 1991). In fact, type I 
appears to require type II to bind TGF-p and type I appears to be necessary for 
type II to signal (Wrana et al., 1992). Receptor type III and endoglin, which 
have recently been cloned (Wang et al., 1991, L6pez-Casillas et al., 1991; 
Gougos and Letarte, 1990), may have more indirect roles in signalling, 
perhaps presenting ligands to signalling receptors. 



Based upon the high structural similarity of the members of the TGF-p 
superfamily. the receptors for these ligands may have similar structural and 
functional characteristics. The TGF-p superfamily members do not compete 
with each other for receptor binding and thus appear to act through specific 
receptors to elicit different biological effects. In pituitary tumor cells, however, a 
distinct type IV TGF-p receptor, has been identified, which is also recognized 
by activin and inhibin (Cheifetz era/., 1988). Binding studies with activin A and 
BMP-4 have shown two coexisting membrane components of 50-60 and 70- 
80 kd on responsive cells (Hino et al., 1989; Paralkar era/., 1991, Attisano et 
at., 1992). In analogy with TGF-p receptors they may be signalling receptors 
and have been named receptors type I and type II. Direct evidence that 
receptors for TGF-p superfamily members indeed are structurally similar was 
provided by the recent expression cloning and sequence analysis of mouse 
activin receptor type II (mActR-ll; Mathews and Vale, 1991) and human TGF-p 
receptor type II (hTpR-ll; Lin et a/., 1992). These genes show sequence 
homology with each other and with a previously identified C. elegans putative 
receptor gene daf-1 that controls Dauer Larva development (Georgi et al., 
1990). Additionally, another structurally related activin receptor, type UB 
(ActR-IIB), was identified using a low stringency hybridization protocol 
(Mathews et al.. 1992), and using a PCR-based approach (Attisano et al., 
1992; Legerski et al., 1992). The encoded proteins are all putative 
serine/threonine kinase receptors, providing a possible mechanism for 
transmembrane signalling. 

The kinase domains of the serine/threonine kinase receptors show 
significant sequence similarity which define them as a separate kinase family. 
Autophosphorylation on serine and threonine residues of the kinase domain 
of hTpR-ll has been demonstrated in a bacterial fusion protein (Lin et al., 
1992). The extracellular domains of these receptors show little sequence 
similarity, but all are rich in cysteine residues (Lin et al., 1992). 



To identify additional members of the receptor serine/threonine kinase 
family, which may bind members of TGF-[3 superfamily, we have used a PCR- 
based approach. The strategy made use of the sequence identity between 
short stretches of conserved amino acids, in particular in the kinase domains 
of mActR-ll and Daf-1 , and resulted in the isolation of four, novel members of a 
subfamily of receptor serine/threonine kinases, denoted activin receptor-like 
kinases (ALKs). 

Results 

Identification of novel genes that encode receptor 
serine/threonine kinases 

To identify novel receptor serine/threonine kinases we used a PCR 
based approach similar to that used for the identification and cloning of novel 
tyrosine kinases (Wilks, 1989; Kamb etal., 1989; Reid era/., 1990; Partanen et 
al., 1990; 1992; Raz etal., 1991; Masiakowski and Carrol, 1992). HEL cells 
were chosen as sources of mRNA in ceverse transcription (RT)-PCR, as they 
have been shown to respond to both activin and TGF-(3 (Hino et al., 1989; 
Sing etal., 1988). 

Degenerate PCR primers were designed based upon short stretches of 
amino acid sequence identity in the mActR-ll (Mathews and Vale, 1991) and 
daf-1 gene product (Georgi et al., 1990). Three PCR primers allowed us to 
obtain PCR recombinants encoding novel amino acid sequences with 
conservation of specific amino acid residues found in ActR-ll and Daf-1 ; primer 
B3-S, derived from the kinase subdomain II (nomenclature according to 
Hanks et at., 1988), encodes the lysine residue involved in ATP binding, and 
primers B7-S and E8-AS. derived from the kinase subdomains VII and VIII, 
encode serineAheronine kinase specific residues. 



Using primers B3-S and E8-AS. the PGR recombinants derived from 
previously identified ActR-ll and TpR-ll as well as three novel PCR 
recombinants were identified, termed 11.1, 1 1.2 and 1 1.3, and using primers 
B7-S and E8-AS, four other novel PCR recombinants were identified, 
including PCR recombinant 5.2. The four receptor genes that are presented in 
this report correspond to four different PCR recombinants that were closely 
related to the activin and TGF-p* type II receptors. 

cDNA cloning and sequence analysis of ALK-1, ALK-2, ALK-3, 
and ALK-4 

To clone cDNA for ALK-1 we screened an oligo(dT) primed human 
placenta cDNA library with a radiolabeled insert derived from the PCR 
recombinant 1 1.3. Three different types of clones were identified, based upon 
their EcoRI restriction enzyme pattern. Many positive clones of one class with 
an approximate insert size of 2 kb, were isolated in the library screening, 
which are likely to correspond to the 2.2 kb transcript size seen in Northern 
blots probed for ALK-1 (see Figure 4A). The cDNA clone HP 57 was chosen 
as representative of this class and subjected to complete sequencing. Of the 
other two types only one clone was isolated with insert sizes of 1.7 kb and 3.5 
kb, respectively. The latter cDNA clone may correspond to the longer ALK-1 
transcript seen in Northern blots (data not shown). 

Sequence analysis of ALK-1 (cDNA clone HP 57) revealed a sequence 
of 1984 nucleotides including a poly A tail (Figure 1A). The longest open 
reading frame encodes a protein of 503 amino acids, with high sequence 
identity to receptor serineAhreonine kinases (see below). The first methionine 
codon, the putative translation start site, is at nucleotides 283-285 and is 
proceeded by an in-frame stop codon. This first ATG is in a more favorable 
context for translation initiation (Kozak, 1987) than the second and third in- 
frame ATG at nucleotides 316-318 and 325-327. The putative initiation codon 



is preceeded by a 5' untranslated sequence of 282 nucleotides that is GC-rich 
(80 % GC), which is not uncommon for growth factor receptors (Kozak, 1991). 
The 3" untranslated sequence comprises 193 nucleotides and ends with a 
poly A tail. No bona fide poly A addition signal is found, but there is a 
sequence AATACA 17-22 nucleotides upstream of the poly A tail which could 
possibly serve as a poly A addition signal. 

ALK-2 cDNA was cloned by screening an amplified oligo(dT) primed 
human placenta cDNA library with a radiolabeled insert derived from the 
recombinant 11.2. Two clones termed HP 53 and HP64 with insert sizes of 2.7 
kb and 2.4 kb, respectively, were identified, and their sequences were 
determined. We found no sequence difference in the overlapping clones and 
conclude that both are derived from transcripts of the same gene. 

Sequence analysis of cDNA clone HP 53 for ALK-2 (Figure 1B) 
revealed a sequence of 2719 nucleotides with a poly A tail. The longest open 
reading frame encodes a protein of 509 amino acids. The first ATG at 
nucleotides 104-106 agrees favorable with Kozak's consensus sequence with 
an A at position -3. This ATG is preceded in-frame by a stop codon. There are 
four ATG codons in close proximity further downstream, which agree with the 
Kozak's consensus sequence (Kozak, 1987), but according to Kozak's 
scanning model the first ATG is predicted to be the translation start site 
{Kozak, 1987). The 5' untranslated sequence is 103 nucleotides. The 3' 
untranslated sequence of 1089 nucleotides contains a polyadenylation signal 
located 9-14 nucleotides upstream from the poly A tail. The cDNA clone HP 64 
lacks 498 nucleotides from 5' end compared to HP 53, but the sequence 
extended at the 3' end with 190 nucleotides and poly A tail is absent (data not 
shown). This suggests that different polyadenylation sites occur for ALK-2. In 
Northern blots, however, only one transcript was detected (see below). 

The cDNA for ALK-3 was cloned by initially screening an oligo(dT) 
primed human foreskin fibroblast cDNA library with an oligonucleotide derived 



from the PCR recombinant 5.2. One positive cDNA clone with an insert size of 
3 kb, ON11, was identified. Upon partial sequencing, it appeared that this 
clone was incomplete; it encodes only part of the kinase domain and lacks the 
extracellular domain. Sequences corresponding to the truncated kinase 
domain were subsequently used to probe a random primed fibroblast cDNA 
library from which one cDNA clone with an insert size of 3 kb, termed ONF5 
(Figure 1C), was isolated. Sequence analysis of ONF5 revealed a sequence 
of 2932 nucleotides without poly A tail, suggesting that this clone was derived 
by internal priming. The longest open reading frame codes for a protein of 532 
amino acids. The first ATG codon is compatible with Kozak's consensus 
sequence (Kozak, 1987), is at 310-312 nucleotides and is preceeded by an 
in-frame stop codon. The 5' and 3' untranslated sequences are 309 and 1027 
nucleotides long, respectively. 

ALK-4 cDNA was identified by screening a human oligo(dT) primed 
human erythroleukemia cDNA library with the PCR recombinant 11.1 as a 
probe. One cDNA clone, termed 11H8, was identified with an insert size of 2 
kb (Figure 1 D). An open reading frame was found encoding a protein 
sequence of 383 amino acids with high similarity to receptor serine/threonine 
kinases; however, the extracellular domain was truncated, indicating that the 
isolated cDNA was incomplete. The 3' untranslated sequence is 818 
nucleotides and does not contain a poly A tail, suggesting that the cDNA was 
internally primed. 

Comparison of the structures of ALKs 

ALK-1, ALK-2 and ALK-3 have a similar domain structure (Figure 2). 
Hydrophilic, relatively short (97 to 129 amino acids) putative extracellular 
domains are flanked by hydrophobic leader sequences at the N-terminais 
(von Heijne, 1986), and by putative single transmembrane domains (Kyte and 
Doolittle, 1982), which are followed by cytoplasmic domains at the C- 



terminals. The putative ligand binding domains of ALKs share little sequence 
identity (15-20%) but each contains 10 conserved cysteine residues (Figure 
2). Thus, it is likely that their three dimensional structures are similar. Each of 
the ALKs has one potential N-linked glycosylation site; its position is 
conserved in ALK-1 and ALK-2, but not in ALK-3. The intracellular portions of 
ALK-1, -2, -3 and -4 consist almost entirely of kinase domains that have 60- 
79% sequence identity (Figure 2 and 3A). Based upon the sequence 
similarity between activin and TGF-p type II receptors, the C-termini of the 
kinase domains of ALK-1, ALK-2, ALK-3 and ALK-4 are set at Ser-495, Ser- 
501, Ser-527 and Gln-378, respectively. A consensus sequence for the 
binding of ATP (Gly-X-X-Gly-X-X-Gly followed by Lys further downstream in 
subdomain II) is found in four ALKs. Analysis of the amino acid sequences in 
subdomains VI and VIII, that are most useful to distinguish the specificity of 
amino acid phosphorylation, predicts that ALKs are serine/threonine kinases 
(see discussion). Two kinase inserts are present between kinase subdomains 
VIA and VIB, and between subdomains X and XI, respectively (Figure 2). The 
regions of highest divergence in the intracellular parts of ALKs are in the 
juxtamembrane segments (Figure 2). 9 

Phylogenetic relationship of receptor serine/threonine kinases 

A search of the individual ALKs in the database identifies ActR-ll, ActR- 
IIB, Tp*R-ll and Daf-1 as the closest homologues of ALKs, with which they 
share approximately 40% amino acid identity (Figure 3A). 

The sequence of an additional member of the receptor serine/threonine 
kinase family, ALK-5, is presented elsewhere (Franzen et al., submitted for 
publication). ALK-5 and ALK-4 have 90% sequence identity in their kinase 
domains (Figure 3A). In comparison, ActR-ll and ActR-IIB, which both bind 
activin, have a sequence identity in their kinase domains of 78%. Pairwise 
comparison of members of the serine/threonine kinase family of receptors 



using the Jutun-Hein sequence alignment program (Hein, 1990), identifies 
ALK-1 through -5 as a separate subclass among the receptor serine/threonine 
kinases (Figure 3B). 

mRNA expression of ALK-1 through -4 

The distribution of ALK-1, -2, -3 and -4 in various human tissues (heart, 
brain, placenta, lung, liver, skeletal muscle, kidney and pancreas) was 
determined by Northern blot analysis of mRNA (Figure 4). In order to minimize 
cross-hybridization, probes were used that did not encode part of the kinase 
domains, but corresponded to the highly diverged sequences of either 5' 
untranslated and ligand binding regions (probes for ALK-1, -2, and -3) or 3' 
untranslated sequences (probe for ALK-4). 

Using the probe for ALK-1, two trancripts of 2.2 and 4.9 kb were 
detected (Figure 4A). The ALK-1 expression level varied strongly between 
different tissues; high in placenta and lung, moderate in heart, muscle and 
kidney, and low (to not detectable) in brain, liver and pancreas. The relative 
ratios between the two transcripts were similar in most tissues; in kidney, 
however, there was relatively more ©f the 4.9 kb transcript. By reprobing the 
blot with mRNA from different tissues using a probe for ALK-2, one transcript of 
4.0 kb was detected (Figure 4B), with a ubiquitous expression pattern. 
Expression was detected in every tissue investigated and was highest in 
placenta and skeletal muscle. Subsequently, the multiple tissue blot was 
reprobed with a probe for ALK-3. One major transcript of 4.4 kb and a minor 
transcript of 7.9 kb were detected (Figure 4C). Expression was high in skeletal 
muscle, in which also an additional minor transcript of 10 kb was observed. 
Moderate levels of ALK-3 mRNA were detected in heart, placenta, kidney and 
pancreas, and low (to not detectable) expression was found in brain, lung and 
liver. The relative ratios between the different trancripts were similar in the 
tested tissues with the 4.4 kb transcript being the predominant one, with the 



exception for brain where both transcripts were expressed at a similar level. 
Probing the blot with ALK-4 indicated the presence of a transcript with the 
estimated size of 5.2 kb, and revealed an ubiquitous expression pattern 
(Figure 4D). 

Discussion 

Cellular proliferation and differentiation are controlled, in part, by a 
complex network of interacting growth factors. A large number of growth 
factors mediate their effects by binding to cell surface receptors that possess 
intrinsic tigand-activated tyrosine kinase activities (for review see, Ullrich and 
Schlessinger, 1990). Another family of transmembrane receptors for growth 
regulatory factors with serine/threonine kinase domains has recently been 
found and is rapidly growing. The present study was initiated to clone 
additional members of this receptor family that may bind ligands of the TGF-p 
superfamily. Here we describe the cloning and sequence analysis of four 
novel human receptors, ALK-1, -2, -3, and -4, which together with ALK-5 
(Franzen et al.. submitted), ActR-il, ActR-llB and TpR-ll, constitute the 
mammalian receptor serine/threonine kinase family. 

The ALKs share a similar domain structure, i.e. relatively short 
extracellular domains, transmembrane domains and cytoplasmic protein 
kinase domains. The extracellular domains share little sequence identity, but 
contain conserved cysteine residues. In the intracellular portion the kinase 
domain of ALKs share a high (60-79%) sequence identity and juxtamembrane 
segments are most dissimilar. The significantly higher sequence identities 
between the kinase domains of ALKs compared with the identities between 
ALKs and other kinases defines ALKs as a separate subfamily among the 
serine/threonine kinases. A characteristic for ALKs, shared with ActR-ll, TGR-II 
and Daf-I, is that the protein kinase domain is interrupted by two short kinase 



inserts between subdomains VIA and VIB, and subdomains X and XI. ActR-ll, 
TPR-II and Daf-1 have C-terminal tails of 23-28 amino acids that are rich in 
serine and threonine residues and could function as autophosphorylation 
sites. ALKs, however, have relatively short C-terminal tails of only 5 to 8 amino 
acid residues, which lack serine and threonine residues exept for one 
threonine residue in ALK-2. Comparison of ALKs with other mammalian 
receptor serine/threonine kinases shows that the regions that diverge most in 
the intracellular domains are in the juxtamembrane segments, kinase inserts 
and C-terminal tails. If these regions, in analogy with the situation that prevail 
for tyrosine kinases, are important for interaction with down-stream 
components in the signal transduction pathway, then ALKs as a group may 
share such components, but the specificity is likely to be different for other 
receptor serine/threonine kinases. 

The residues of ALKs in the kinase subdomain VIB and VIII predict that 
they are serine/threonine kinases. The sequence motifs HRDLKSKN 
(subdomain VIB) and GTKRYMAPE (subdomain VIII) are highly conserved in 
ALKs, ActR-ll. ActR-IIB, T(3R-II and Daf-1 and other serine/threonine kinases, 
but not in tyrosine kinases. In tyrosine kinases tyrosine residue(s) surrounded 
by acidic amino acids are found between subdomains VII and VIII. In ALK-1, 
but not in other ALKs. a tyrosine residue is present at position 361 and is 
flanked by two aspartic acid residues. Of note, purified mActR-IIB was 
recently shown to be a dual kinase which phosphorylates itself as well as 
exogenous substrates on serine, threonine and tyrosine residues (Nakamura 
et al, 1992). Our current work is aimed at verifying that ALKs have kinase 
activity, and determine whether serine, threonine or tyrosine residues are 
phosphorylated. 

The high sequence identity between ALKs and ActR-ll and TpR-ll 
suggests that ALKs may also function as signalling type II receptors for 
members of the TGF-p* superfamiiy. However, the molecular masses of the 



mature ALK-1, -2 and -3 proteins are 53.9 kd. 55.7 kd and 57.5 kd. 
respectively; thus the possibility that members of the ALK family are type I 
receptors for TGF-6 superfamiiy member, has not been excluded. 

Whereas the expressions of ALK-2, ALK-4 and ALK-5 are ubiquitous, 
the expressions of ALK-1 and ALK-3 show more restricted patterns (Figure 4 
and Franzen et a!., manuscript submitted). The expression profile of each 
ALKs is different. Thus, it is likely that the different ALKs serve different 
functions. The accumulated data support the notion that growth regulatory 
factors act preferentially in autocrine and/or paracrine manners. If this is 
correct for the ligands for ALKs, ALK-2 and ALK-4 may bind to ligands with 
effects on many tissue types, exemplified by TGF-p\ ALK-1, predominantly 
expressed in placenta and lung, and ALK-3, highly expressed in muscle, may 
bind to ligands with more restricted action range, exemplified by MIS. Further 
studies are in progress to identify the ligands for ALKs. This involves a 
detailed characterization of the expression patterns of ALKs to allow a 
comparison with those of possible ligands, as well as direct binding 
experiments. 

All detected transcript sizes were different, and thus no cross-reaction 
between mRNAs for the different ALKs was observed when the specific 
probes were used. This suggests that the multiple transcripts of ALK-1 and 
ALK-3 are coded from the same gene. The mechanism for generation of the 
different transcripts is unknown at present; they may be formed by alternative 
mRNA splicing, different polyadenylation, use of different promoters, or by a 
combination of these events. The functional significance of the presence of 
different transcripts also remains to be elucidated. Difference in mRNA 
splicing in the regions coding for the extracellular domains may lead to the 
synthesis of receptors with different affinities for ligands, as was shown for 
mActR-IIB (Attisano etal., 1992), or the production of soluble binding proteins. 
Differences in the intracellular domains may lead to truncated cell surface 



receptors that may possibly act in a dominant negative fashion, or act as 
receptors with different enzymatic activity or substrate specificity. 

Many TGF-p superfamily members and their receptors appear to have 
important functions during early development in mediating the induction and 
pattern formation of mesodermal tissues. The injection of mRNA for ActR-ll 
(Kondo etai., 1991) and ActR-IIB (Mathews eta!., 1992) leads to embryos that 
display developmental defects characterized by inappropriate formation of 
dorsal mesodermal tissue. ALKs may also have important roles during the 
development. Thus, it will be important to determine their expression in 
different tissues during different developmental stages, and to measure their 
effects on mesoderm formation after injection of their mRNA into Xenopus 
embryos. 

Our experimental strategy appears to have selectively amplified 
sequences encoding parts of putative receptor serine/threonine kinase genes. 
Thus far no genes which encode cytoplasmic serine/threonine kinases were 
picked up. Since about 20 members of TGF-p superfamily have been 
identified, it is anticipated that there are more members of the receptor 
serine/threonine kinase family, rt is also possible that some of the 
serine/threonine kinases may bind other proteins unrelated to the members of 
the TGF-p superfamily, e.g. members of Wnt protein family. The novel genes 
we identified by PCR were all derived from mRNA from HEL cells. It is likely 
that use of other mRNA sources, in combination with the same or similar PCR 
primers, will lead to the identification of additional members in this receptor 
family. 

Materials and methods 



Nucleic acids 



RNA was prepared from HEL cells using the guanidium isothiocyanate 
method (Chirgwin et al.< 1979). Poly(A+) RNA was selected by polyAT tract 
system (Promega) as described by the manufacturers or purified through 
oligo(dT)-cellulose column as described (Aviv and Leder, 1972). 
Oligonucleotides were synthesized with a Gene assembler plus (Pharmacia- 
LKB) using phosphoramidite technique, deprotected in 30% ammonia for 2 hr 
at 80°C, and then desalted on NAP-10 columns (Pharmacia-LKB). 

RT-PCR 

2 u.g of HEL mRNA was reverse transcribed into cDNA in the presence 
of 50 mM Tris-HCI. pH 8.3, 8 mM magnesium chloride, 30 mM potassium 
cloride. 10 mM dithiothreitol, 2 mM nucleotide triphosphates, excess oligo(dT) 
primers and 34 units of AMV reverse transcriptase at 42°C for 2 hr in 40 u.l of 
reaction volume. For amplification of the mActR-ll/Daf-1 like sequences three 
different primers were used. The primers were designed so that each primer 
set had a similar GC content, and so that self complementarity and 
complementarity between the 3' ends of the primers were avoided. The 
degeneracy of the primers was kep't as low as possible, by avoiding serine, 
leucine and arginine residues, which have 6 possible codons, and applying 
human codon preference. Degeneracy was particularly avoided at the 3* end, 
since unlike the 5' end where mismatches are tolerated, mismatches at the 3' 
end may dramatically reduce the efficiency of PCR. In order to facilitate 
directional subcloning, sequences were included at the 5' end that lead to 
restriction sites in the PCR products; GC clamps were then included at the 
most 5' end to permit efficient restriction enzyme digestion. The B3-S primer is 
a 25-mer sense oligonucleotide containing a SamHI site at a 5' end with a 
162-fold degeneracy (5'- GCGGATCCGT(CGT)GC(ACT)GT(CGT)AA(AG)AT 
(TCA)TT-3'), derived from the conserved sequence motif "VAVKIF" in the 
kinase subdomain II. The B7-S primer is a 24-mer sense oligonucleotide 



containing a BamH\ site at the 5' end with a 288-fold degeneracy (5'- 
GCGGATCCG(CAG)GA(TC)AT(TCA)AA(AG)(AT)(GC)(CT)AA-3 , ) 1 derived from 
the "RD1KSKN" sequence motif in the kinase subdomain VII. The E8-AS 
primer is a 20-mer antisense oligonucleotide containing an EcoRI site at the 5' 
end with a degeneracy of 18-fold (5'-CGGAATTC(TGA)GG(TGA)GCCAT 
(AG)TA-3') ( derived from the "YMAPE" sequence motif from the kinase domain 
VIII. Amplification by PCR was carried out with a 7.5% aliquot (3 u.l) of the 
reverse transcribed mRNA, in the presence of 10 mM Tris-HCI, pH 8.3, 50 mM 
potassium chloride, 1.5 mM magnesium chloride, 0.01% gelatin, 0.2 mM 
nucleotide triphosphates, 1 nM of both sense and antisense primers and 2.5 
units of Taq polymerase (Perkin Elmer Cetus) in 100 \x\ reaction volume. 
Amplifications were performed on a thermal cycler (Perkin Elmer Cetus) using 
the following program: first 5 cycles with denaturation for 1 min at 94°C, 
annealing for 1 min at 50°C, a 2 min ramp to 55°C and elongation for 1 min at 
72°C, followed by 20 cycles of 1 min at 94°C, 30 sec at 55°C and 1 min at 
72°C. Thereafter, a 3% aliquot (3 u,l) of the material of the first reaction was 
taken and processed for a second round of PCR amplification; 25 cycles of 1 
min at 94°C, 30 sec at 55°C, and l.min at 72°C. Samples were digested with 
BamH\ and EcoRI and subsequently fractionated by low melting point agarose 
gel electrophoresis. Bands corresponding to the approximate expected sizes 
(460 bp for primer pair B3-S and E8-AS and 140 bp for primer pair B7-S and 
E8-AS) were excized from the gel and DNA was purified. Subsequently, these 
fragments were ligated into pUC!9, that had been previously linearized with 
BamH\ and EcoRI and transformed into E. coli strain DH5a using standard 
protocols (Sambrook et al., 1989). 

cDNA libraries 

Poly(A+) RNA was isolated from human term placenta. Subsequently, 
the poly(A+)RNA was converted into cDNA using MLV reverse transcriptase 




(Superscript, BRL) and used to generate an oligo(dT) primed human placenta 
XZAP II cDNA library of 5 x 10 s independent clones, which was amplified. 
Poly(A+) RNA isolated from AG 1518 human foreskin fibroblasts was used to 
prepare a primary random primed XZAP II cDNA library of 1.5 x 10 6 
independent clones using Ribo Clone cDNA synthesis system (Promega) and 
Gigapack Gold II packaging extract (Statagene). A primary oligo(dT) primed 
human foreskin fibroblast XgtlO cDNA library (Claesson-Welsh et aJ., 1989) 
was generously provided by Lena Claesson-Welsh. An amplified oligo(dT) 
primed HEL cell Xgt11 cDNA library of 1.5 x 10 6 independent clones (Poncz et 
at., 1987) was generously provided by Mortimer Poncz and Kari Alitalo. 

Isolation of cDNA clones 

Employing RT-PCR on HEL mRNA with the primer pair B3-S and E8- 
AS, three PCR recombinants were obtained, termed 11.1, 1 1 .2 and 1 1 .3, that 
corresponded to novel genes. Using the primer pair B7-S and E8-AS, four 
additional novel PCR recombinants were obtained of which one termed PCR 
recombinant 5.2 is presented in this report. For ALK-1 , the placenta cDNA 
library was screened with radiolabeled insert of PCR recombinant 1 1.3. Based 
upon their restriction enzyme digestion pattern, three different types of clones 
with approximate insert sizes of 1.7 kb, 2 kb and 3.5 kb were identified. Clone 
HP 57 with an insert size of 2 kb was chosen for complete sequencing. For 
ALK-2 the placenta library was screened with the PCR product 1 1 .2. Two 
clones, termed HP53 and HP64 were identified with insert sizes of 2.7 kb and 
2.4 kb, respectively, and their DNA sequences were determined. For ALK-3, 
the oligo(dT) primed fibroblast cDNA library was screened with an 
oligonucleotide (5'-ATTCAAGGGCACATCAACTTCATTTGTGTCACTGTTG-3') 
derived from the PCR recombinant 5.2. One partial clone, termed ON1 1 , with 
an insert size of 3 kb, was isolated and partially sequenced. The most 5' 
sequences, a 540 nucleotides Xoal restriction fragment encoding a truncated 



kinase domain, was used to screen the random primed fibroblast cDNA 
library. One clone, ONF5, with an insert size of 3 kb was obtained, and its 
sequence was determined. For ALK-4, the oligo(dT) primed HEL cell cDNA 
library was screened with the radiolabeled insert of the PCR recombinant 11.1 
as a probe. One positive clone with an insert size of 2 kb was obtained and its 
sequence was determined. Hybridization and purification of positive 
bacteriophages were performed as described previously (Kanzaki ef a/., 
1990). Labeling of the inserts of PCR recombinants was performed using 
random priming method (Feinberg and Vogelstein, 1983) and oligonucleotide 
derived from the sequence of the PCR recombinant 5.2 was labeled by 
phosphorylation with T4 polynucleotide kinase following standard protocols 
(Sambrook era/., 1989). 

DNA sequencing and sequence analysis 

Double stranded DNA was sequenced by the dideoxy chain termination 
method (Sanger era/., 1977) using T7 DNA polymerase (Pharmacia-LKB). 
Sequencing of plasmid inserts was performed using M13 universal, M13 
reverse as well as specific prirgers and unidirectional deletion clones 
generated by Exonuclease III digestion (Henikoff.1984). Compressions were 
resolved using 7-deaza-GTP (U.S. Biochemical Corp.). DNA sequences were 
analysed using DNA STAR computer program (DNA STAR). 

Northern blot analysis 

RNA blots were hybridized with 32 P-labeled probes at 42°C overnight in 
50% formamide, 5 x standard saline citrate (SSC; 1 x SSC is 50 mM sodium 
citrate. pH 7.0, 150 mM sodium chloride), 0.1% sodium dodecyl sulfate (SDS), 
50 mM sodium phosphate and 0.1 mg/ml salmon sperm DNA. The probes 
were labeled by random priming (Feinberg and Vogelstein, 1983) and 
unincorporated label was removed by Sephadex G-25 chromatography. 



Filters were washed at 65°C. two times for 30 min in 2.5 x SSC. 0.1% SDS 
and two times for 30 min in 0.3 x SSC , 0.1% SDS. Stripping of the blots was 
performed by incubation at 90°C - 100°C in water for 20 min. 
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Figure legends 



Fig. 1. Nucleotide and deduced amino acid sequences of cDNA's for ALK-1 
(A), ALK-2 (B), ALK-3 (C) and ALK-4 <D). For each of the ALKs the following 
features are indicated; the N-terminal hydrophobic signal sequences as 
determined by the von Heijne algorithm (von Heijne, 1986) and the putative 
transmembrane sequences are overlined (thin lines); the sites for potential re- 
linked glycosylation are overlined (thick lines), cysteine residues found in the 
extracellular domain are boxed and the borders of the kinase domains are 
marked by arrows. The regions of the primers that lead to the identification of 
the sequences are indicated by half arrows. The stop codons which end the 
open reading frame are marked by asterisks. The putative polyadenylation 
signal AATAAA, if present, is underlined. The nucleotide sequences are 
numbered from the 5' ends of the cDNA clones and protein sequences are 
numbered starting from the N-termini. Both are indicated at the right of each 
lane. 

Fig. 2. Comparison of human amino % acid sequences of ActR-ll (Donaldson et 
al., 1992; Matzuk and Bradley, 1992), TpR-ll (Lin et at., 1992) and ALK-1 
through -4. Identical amino acids are boxed and conserved cysteine residues 
in the extracellular domains are indicated with dots. Putative transmembrane 
regions are double overlined. The borders of the kinase domains are 
indicated by arrows. Kinase subdomains are indicated with roman numerals, 
according to the nomenclature of Hanks et al. (1988). Alignment was 
performed using the Clustal V alignment program (Higgins and Sharp, 1989), 
with some manual adjustment. 

Fig. 3. Comparison of kinase domains of serine/threonine kinases. (A) 
Percent amino acid identity of the kinase domains of ALKs, hActR-ll 



(Donaldson et a/., 1992; Matzuk and Bradley. 1992), mActR-IIB (Attisano etal., 
1992). hTpR-ll (Lin etal., 1992) and Daf-1 (Georgi etal., 1990). The percent 
identity was calculated using Jotun-Hein alignment program (Hein, 1990). (B) 
Pairwise alignment relationship between the kinase domain of the receptor 
serine/threonine kinases, the dendrogram was generated using the Jotun- 
Hein alignment program (Hein. 1990). 

Fig. 4. Northern blot analysis of ALJCs. A multiple human tissue blot (Clontech) 
was hybridized with probes for ALK-1 (Sfi\-BstX\ restriction fragment, 
corresponding to nucleotides 288 to 670 of ALK-1 cDNA), ALK-2 (EcoR\-Ava\ 
restriction fragment, including nucleotides 1 to 581 of ALK-2 cDNA), ALK-3 
(EcoR\-Pvu\\ restriction fragment, corresponding to nucleotides 79 to 824 of 
ALK-3 cDNA) and ALK-4 (Sac\-EcoR\ restriction fragment, including 
nucleotides 1178 to 1967 of ALK-4 cDNA). Size markers are indicated on the 
left. 
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GATCTCCIGCCCAATGACCCCACCTTTCACCACATCAACAACCICCICTGICTCCAICACCACACCCCCACCArcCCIAi 



CCCCTACATCCCACCCCACGTCCTCCACCACCACATCCCC 1110 



CC TGAA TGGC A TCG TGGAGGAC TATAGACCACCCTTCTAT I 
V N C I VEOYRPPFT 

XTCCCTCCACACCCCCTCCTCTCAGCCCrACCTCAG 1 



ATCATGCCGCACTCCTCGTACCCAAACCCCTCTCCCCCACTCACCGCCCTGCCCATCAACAACACACTACAAAAAATTACCAACACTCCACACAACCCTAAACTGAirCAAlACCCCAG- I 
RHRECVYPNPSARLTALR IKKTLOK I ^ N s I P E K P « v , o . 

ACCACCTGAITCCTTTCTCCCTCCACGGGGCTCCCCGGCTCCGGCCCACTCCAICCICCCCIATCTCCGIAGACGIACTCTGACICTCCTCTCTCCTCCCCAICGGCACCTCCCCC7CCC 1 
rGCTCGCCCCCCACCCCACCCACCCAAAAATACAGCTCCCCTCAAACCTCAAAAAAAAAAAAAA 



"'CCACTCTCCCCACATCrCACCAACACCCTCCATTAACrrCTACA, 
C A TCAAA TGCGA TCGTTCTACGAC TA TCTirAmr *r t » /■ MSST QLW(.ITht 28 s 

- « . . . l .vc^iTrr. , rrTT n LTrr^ , " , "« re, ^"«-*'«« « .™ 

A T CC T AA A ACA CA TC TCC CC A TT AAC TC AC TCCC T T T<^ A TACCTTTC AC A ACTCTCC TAG AC AC ^CCCAr^ ^CCACCAAC ACACACAA , 918 

^""TTCAATTTCCCAAAACAAAATCTAAICTCACACTTTGCTGCATTTTACACAT^ C CA^cI La\t, CTTC0CTC " T4CTCC4CTCCTCTCICTTT « iT * 2,SB 

™™»"^^ 22 78 

rirri^ ^ "98 

CACCCA,TATATTATCCATTTACAACCAA«Am C A C TACAArTTTACT^ 25,8 

aaaataactattttcttttaatctacittttctatttactacttatttctataa^ttaaataaac^cttt^ »« 




JCTCCCCCCCCACCCCtCC*CCAICCCIICCC;CCCCTCCCCACIIAT&AAAAI*TCCAIC»CTTT4AT*CTCtCTTCCA*riC*ICACJktCCA*CC*tACCTC*A*CCTCITICCAC** \2Q 

AAICACAACTACACITTTATCTACCCACATCTICCACCACICCTAACAAACCACICCCACITCAACTCATICICAAClCCTTCCCAICITTIACAACAAAATCTCACrCAATCAIACTCA 2<0 

TT T AAATTGCTCAACT ACC A ACACCAA T TA T T AAAGCTCAC ACT AC AC ACCAAAC A T T AC A AT TCA AC A A TCAC TC ACC TATACAITTAC A TC ACA TTATTCCGACCCTATT TCT TC A TC 360 

~H T Q L T i T [ R L L G A Y L F T 17 

AT T TC TCCTCT TCA ACCA C ACAA TC TCCA T ACT A TCC T TC ATCCC AC TCCCATCAA ATC AC AC TCCGACC ACAA AA ACT C ACAA AA TCCACT AACC T T ACC ACC ACACCA TACCTTCCCT « 8 o 

ISR¥QCQML0Sf»LHCTCf1KS0S0QKKSCMCVTLAP£ 0TUP 57 

TTTTTAAACTCCTATTGCTCACGCCACTCTCCACATGATCCTATT AATAACACA TCCATAACTAATCCACATTCCTTTCCCATCATACAACAACATCACCACCCACAAACCACATTACCT 8O0 

r l <0r[c]s c h[c]i> o o » i k » i[t]i tncm(T]fa i i e e d o a c e t t l a 97 

TCAGCCTCTATCAAATATCAACCATCTGATTTTCAGTCCAAAGATTCTCCAAAAGCCCACCTACGCCGGACAATAGAATGTTCTCCGACC/ATTTATGTAACCACTATTTCC AACCCACA 720 

SC[C]"*TECS0FQ[F1>C0SPKAQLRRT IE 00 RTML(T]MQTt.QPT 137 

CTCCCCCCTCTTCTCATACCTCCCTTTTTTCATCCCACCATTCCA TCCCTCGTTTTGC TCATTTCTATGGCTCTCTCCATAATTCCTATGATCATCTTCTCCAGCTCCTTTTGTTAC AAA 610 

LPPYYICPFFOCS IRW L YLLISHAVCI IAHI IFSSCFCYK 177 

CATT ATTCC AAGAGC ATC TC A AGCACACCTCCTTAC AATCCTCATT TCCAACAGCATCA ACC AT TT A T TCC ACT TGCAGAATC AC TAAAACACC T T AT TCACC AGTC AC AA ACT TC TGCT 960 

HICKS I 3 SRR RYNRO LCQDEAF I PVCE SLKOL I 00 SO S SC 2 1 7 

ACTGCCTCTCCACTACCTTTATTCCTTCACCCAACTATTCCCAAACAGATTCACATCCTCCCGCAACTTCCTAAACCCCCATATCCACAJkCTATCCATCCCCAAATCCCCTCCCCAAAAA 1080 

SCSCUPLLVQRTIAKOI 0 | W V Jt QVCKCR YCE YVflCKVR C E K 257 

CTCCCCCTCAAACTATTCTTTACCACTCAACAAGCCACCTCCTTTCCACAAACAGAAATCTACCAAACTCTGCTAATCCCCCATCAAAACAIACTTCCTTTCATAGCGCCACACATTAAA 1200 

VA»KVFFTTEEASWFRETEIYOTVLflRMENIUCFIAAOIK 297 

CCTACAGGTTCCTGCACTCACCTCTATTTCATTACTCATTACCATGAAAATGCATCTCTCTATGACTTCCTCAAATGTGCTACACTCCACACCACACCCCTCCTTAAATTCCCTTATTCA 1320 

CTCSWTOLYLITOYHEMCSLYOFL'CCATLOTRAULKLAYS 337 

CCTGCCTGTCGTC TCTCCC ACC TCCAC AC ACAAA T TT A TGGC ACCC A AGGAAAGCCCGCAA T TGCTC A TCCAGACC T AA ACACCAA AA AC A TCC TC A TC A ACA A AA A TCGGAGT TGC T CC M40 

AACCLCHLMTEIYGTQCKPA I AH R 0 L K S K H ILIKKMGSCC 377 

AT TCC TCACC TCCCCCTTCCTCTTAAATTCAACACTCACACAAATCAACTTCATCTCCCCTTCAAT ACC ACCGTCGGCACCAAACGCT ACA TCCCTCCCCAACTGCTCGACCAAAGCCTC 1560 

IAOLCLAVKFNSDTNEVOVPLNTRVCTKR T H A P E V L 0 E S L 417 

AAC A AAA ACC AC TTCC ACCCC T AC ATC ATCGCTCAC ATC T AC ACC TTCCCCCTAATC AT T TCCCACA TGCC TCCTCCT TCTA TC AC ACCACCCA TCC TCCA ACA AT ACC AA TTCCCATAT 1680 

HKMHFQPY IHAOIYSFGLI I V £ fl A R R C ITCGI YEEYOLPY 457 

TACAACATCCTACCCACTCATCCCTCATACCAACATATGCCTCACCTTCTCTCTCTCAAACCTTTCCCCCCAATTCTCTCTAATCCCTGCAACACTCATCAATCTCTACCACCACTTTTC 1800 

TNNVPSOPSYCOftRevvCVKRLRP (Y5NRWMS0ECLRAYL 497 

AACCTAATCTCACAATGCTCCCCCCACAATCCACCCTCCACACTCACAOCATTGAGAATTAAGAAGACGCTTGCCAACATGGTTGAATCCCAAGATCTAAAAATCTCATCCITAAACCAT 1920 

KLHSECWAHMPASRLTALR I K K T L A K n ^ £ S | Q 0 V K I • 532 

CCGACCACAAACTCTACACTCCAACAACTGTTTTTACCCATCCCATCCCTCCAATTACACTGCAATAAGCATCTTAACTTCCTTCTCAGACTC TTTC TTC ACT ACCTGTTC AC ACCCTGC 2W0 

TAATAnAAACCTnCACTACTCTTATTACCATACAAOCTCCCAACTTCTAAACACTTCATTCTTTATATATGCACACCTTTATTT-rAAATCTGCTTTTTCATCCCTTTTTTTAACTGCG 2160 

TTTTTATCAaCTCCATC AACACTTC AATCCTCATT ACTCTC TCC ACTCAACCTCT6CGT AC TfiAATTCCCTC TTC A TAAAACCCTCC TTTCTCTCAAACCCTT AAGAACATAAATCACCC 2280 

CACCACACATCGACAAATAGACTTTCCCTT7TACCTCA6ACATTCACTTCCTTTCTATTCTACCTTTCTAAAACAGCCTATACATCATCATCTCTTTCCGATACTCCTTATTTTATGATA 2400 

CTTTCTCCTCTCTCCTTACTCATCTGTCTCTCTCTCCATCCACATCCACCCCCCGATTCCTCTGCTCCCATTTCAATTACAACAAAAIAATTTATATCCATCCACACCAAGATATTCGTG 2520 

CCCCCTSCTTTTCTGCTTTAAAAATGCAATATCTGACCAACATTCCCCAATCTCATACAACCCATTTACTTTGCAACTGAGATAGCTTCCCCACCACCTTTATTTTTTAACATGAAAGCT 2640 

CATCCCAACGCCAAAAGAACTTTAAACCATCTGTAAATTTCCACTCTTTTCCTTCAACCACCATTTTTTTTGTCCTTATTATTTTTCTCACCCAAACCATCCTCTCCAAAGTTCCACCTT 2760 

CTATTGCCATCAACCATCCTTACAAAGAAAGCACTTCT7ATTGAACTCAATTCCTCCATTTCATACCAATCTAAGTCCCTATAACCATCTTCTATATTC TTTATTCTCAGTAACTTTTAA 2880 

AAGGCAACTTATTTATATTTTCTCTAT AATCTGCTTTATTTGCAAATCACCC 2932 



' IACP * F l-LFt.l ' ' <VFLVINTHOR¥TMHft Q( , L ^ 
CLSl<0,{ ' LOOL,fTOl -SISCSCSCLPLFVQ Rl 8C ' 

v LL _o_^, iCKcurccvwRCRwRcc'o v a , < i; s s r £ E , 

Tcn c ; T ; cAC R cc ^ ^_ 

° T " " L " H E " 1 <- 6 F I A A 0 N K 0 K C T W r 0 L w u » s I60 

c ; ci ; fc AT c A cc ^ ^ 

L " " T T " ' 1 E C " ' K l- A L S A A S C L A H L H « E , ?co 

C ^ C "«"«^ 7*) 

CTCACTCACACCAyCACATTCCCCCCAATCA C A C «t C «CACCAAACCAtACArC C CCCCTCAACTACTTCATCAAACCATTAA,ArCA^ 8.0 

T , " * " e » «• o e t i h „ t H r o s f t c a o 290 

T^^ A^^^^C^^^'*^*^^^''^*'**^^^^^^'***^*'^**^^^^''**'^'*^"^^^*^'***^**^*^^*^ TCCCATATT ACCAC ITACtCCCC TC TCACCC T ICC AT TSACIA A 9*0 

IARRCNSCC VHCETQLPT TOLVPSOPS 1 t ' 320 

Arcc^AccTrcTArcTCArcAOAAccrcccTcccA^^ ,o 8 o 

GCCCCCCrCACCCCCCTCC^^ ^ 

C «A^^ ■ « 

C ""*« T =«C«"«*"CACAA C T^^ 6 " 
CCCCTTTCTCTCICCACCCSTCTCTCCATCTCCCCACCICCCTCCCCCCITCTCCCTCCTTCCTCCCATCCCCTTACACCTCCCICTCACTCTCICTCrCT^ 0^ 

1967 
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